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ABSTRACT 

With a complete, mass-selected sample of quiescent galaxies from the NEWFIRM Medium-Band 
Survey (NMBS), we study the stellar populations of the oldest and most massive galaxies (>10 11 
M ) to high redshift. The sample includes 570 quiescent galaxies selected based on their extinction- 
corrected U-V colors out to z=2.2, with accurate photometric redshifts, <j z /(1+z)~2%, and rest- 
frame colors, <tu-v ~0.06 mag. We measure an increase in the intrinsic scatter of the rest-frame 
U-V colors of quiescent galaxies with redshift. This scatter in color arises from the spread in ages 
of the quiescent galaxies, where we see both relatively quiescent red, old galaxies and quiescent blue, 
younger galaxies towards higher redshift. The trends between color and age are consistent with the 
observed composite rest-frame spectral energy distributions (SEDs) of these galaxies. The composite 
SEDs of the reddest and bluest quiescent galaxies are fundamentally different, with remarkably well- 
defined 4000 A- and Balmer-breaks, respectively. Some of the quiescent galaxies may be up to 4 
times older than the average age- and up to the age of the universe, if the assumption of solar 
metallicity is correct. By matching the scatter predicted by models that include growth of the red 
sequence by the transformation of blue galaxies to the observed intrinsic scatter, the data indicate 
that most early-type galaxies formed their stars at high redshift with a burst of star formation prior to 
migrating to the red sequence. The observed U-V color evolution with redshift is weaker than passive 
evolution predicts; possible mechanisms to slow the color evolution include increasing amounts of dust 
in quiescent galaxies towards higher redshift, red mergers at z<l, and a frosting of relatively young 
stars from star formation at later times. 

Subject headings: cosmology: observations - galaxies: evolution - galaxies: formation 



1. INTRODUCTION 

Massive galaxies with strongly suppressed star for- 
mat ion_e2ds_t L _out to and beyond a redshi f t of z ~ 2 
(e.g. lLabbe et all 120051 : iDaddi et all 120051 : iKriek et al.l 
120061 . 120091 : iFontana et all l2009f ). and these red, qui- 
escent galaxies form well-defined color-mass and color- 
mag nitude relations (C MR), known as the red sequence 
(e.g. IKriek et"aLll2008[ ). The scatter in the color of red 
sequence galaxies provides a natural marker for the vari- 
ation in star formation histories. Galaxies with stellar 
populations that formed at different epochs will have an 
intrinsic scatter in their colors, and this scatter will de- 
crease as these red sequence galaxies evolve. 

In simple models, the scatter in the observed color of 
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red sequence galaxies increases with redshift as the frac- 
tional age differences between the galaxies becomes larger 
(iBower et all 119921 : lEUis et al.lll997t Ivan Dokkum et al.l 
119981 ). In more complex models that include morpho- 
logical transformations, the scatter can be constant or 
even decrease with redshift ([van Dokkum fc FranxH20Qll : 
iRomeo et al.ll2QQ8l ). Directly probing the scatter in color 
(as well as the scatter in age) as a function of redshift 
will constrain these models. 

The intrinsic scatter in the U-V colors of local 
early-t ype galaxies in t he Coma Cluster is ~0.03 
mag (Bower et al.l 119921 ). Most of the work at 
higher redshifts has been done only in clusters, find- 
ing a roughly constant internal U-V scatter of ~ 
0.08 ± 0.03 mag at 0.05 < z < 0.8, increas- 
ing to ~ 15 at z=L6 (Ivan Dokkum etall 120001: 
iHolden et aTll 2004: Mcint osh et al . 2005b; Lid man et al. 
2008; Mei et al. 2009; Ruhlan cTeTaLl 120091 : [Hilton et al. 
120091 : iPapovich et al.ll2010f ). To date, there are no mea- 
surements of the intrinsic scatter in the CMR for field 
galaxies at z >1, because this requires accurate rest- 
frame colors and methods of separating star-forming and 
quiescent galaxies. 

At high redshift, the color-mass relation is "diluted" 
by dusty star- forming galaxies, and the dust complicates 
the measurements of the intrinsic color scatter of red, 
old galaxies. This complication is less important at low 
redsh ift because ^70-80% of red galaxies have very little 
dust (|Bell et al.ll2004at IWolf et al.ll2005f ). Furthermore, 
dusty contaminants can be visually identified using mor- 
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phological information out to z ~1 (e.g. iMdntosh et all 
l2QQ5at iRuhland et al.ll2009l ). whereas at higher redshifts 
the galaxies are too faint to resolve with current technol- 
ogy. However, these dusty systems can be identified us- 
i ng additional inform ation: e.g. UVJ rest-frame colors 
([Williams et al. |2009[). the visual d ust extinction from 



SEP modeling (Brammer et al. 200 jl), or mid-IR imaging 
(iPapovich et all 120051 ; i Franx et al] 120081 : iFontana et all 

Currently the best estimates for the ages of stel- 
lar p opulations in mass ive high redshift galaxies comes 
from iKriek et al.l ([20081 ) , who combine broadband mul- 
tiwavelength imaging with NIR spectroscopy for a K- 
selected sample. They find that red sequence galaxies at 
z ~2.3 typically have 0.5-1 Gyr populations with mod- 
erate amounts of dust. However, older galaxies may 
have been missed due to the magnitude limits of the 
if-selection. To understand the properties of quiescent 
galaxies, a large systematic study is required to connect 
and confirm the limited knowledge we have of galaxies 
at z ~2 with the observed properties of local early-type 
galaxies. 

High redshift studies of galaxies are either based on 
small, accurate spectroscopic samples that are biased due 
to the methods of selection, or large photometric samples 
limited by the accuracy of the photometric redshifts and 
depth of the survey. To address the problems associated 
with each m ethod, the NEWFIRM M edium-Band Sur- 
vey (NMBS, Ivan Dokkum et all [2009) was designed to 
improve the photometric redshift accuracies while main- 
taining a large sample of galaxies. For the first time, we 
are able to determine the color scatter of galaxies on the 
red sequence for a complete, mass-selected sample with 
accurate photometric redshifts. 

In this paper, we study the properties of a mass- 
selected sample of galaxies from the NMBS over four 
redshift intervals, focusing on the properties of the qui- 
escent galaxies residing on the red sequence. Here, we 
will use the term "quiescent" to signify old stellar pop- 
ulations with red rest-frame colors (U- V >1A) that are 
not vi gorously forming stars. Following iBrammer et all 
(|2QQ9h . we correct the rest-frame U-V colors for dust red- 
dening allowing a clean separation of the red and blue se- 
quences. As expected, the star formation rates of these 
"quiescent" galaxies from SED modeling are typically 
low (the median SFR is 0.2 M yr _1 ), although some 
probably have ongoing star formation up to the level of 
~10 M yr _1 . Probing the internal scatter for the first 
time at £>1, we show trends of U-V color with the rel- 
ative ages of the stellar populations and the composite 
rest-frame spectral energy distributions (SEDs) of all qui- 
escent galaxies. We show that the results would be simi- 
l ar if w e used the UVJ color selection of I Williams et al.l 
()2009[ ). Finally, we place constraints on the star forma- 
tion histories for passive evolution given the observed 
intrinsic scatter as a function of redshift. 

We assume a ACDM cosmology with =0.3, 
^a=0.7, and Ho=70 km s _1 Mpc -1 throughout the pa- 
per. All magnitudes are given in the AB system. 

2. DATA 

The NEWFIRM Medium-Band Survey (NMBS) em- 
ploys a new technique of using medium-bandwidth near- 
IR filters to sample the Balmer/4000A break from 1.5 < 



z < 3.5 at a higher r esolution than the standa rd broad- 
band near-IR filters (van Dokkum et "all 12009). thereby 
improving the accuracy of the photometric redshifts. We 
briefly summarize the survey here, the full details of the 
reduction, source detection, and generation of the pho- 
tometric catalogs will be described in K.E. Whitaker et 
al. (in prep). The NMBS survey is based on a similar 
concept to the optic al medium-band filters used in the 
COMBO-17 survey ([Wolf et al.ll2003l ). A custom set of 
five medium bandwidth filters in the wavelength range 
of 1-1.7/im were manufactured for the NEWFIRM cam- 
era on the Kitt Peak 4m telescope for the survey. Data 
were taken over the 2008A, 2008B, and 2009A semesters, 
with a total of 75 nights (45 nights through the NOAO 
Survey Program and an additional 30 nights through a 
Yale-NOAO time trade). 

The survey targets two ~0.25 deg 2 fields within the 
Cosmological Evolution Survey (COSMOS) and All- 
wavelength Extended Groth Strip International Survey 
(AEGIS), chosen to take advantage of the wealth of 
publically- available ancillary data over a broad wave- 
length range. We combine our five medium-band near- 
IR images and a broad-band K-band image taken with 
the NEWFIRM camera with optical images in the ugriz 
broad-band filters of both survey fields, made publi- 
cally available through the CFHT Leg acy Surve;fl using 
images reduced by the CARS team (jErben et al.l 12009c 
iHildebrandt et aLl l2QQ9h . Additionally, we include deep 
Subaru images with Xhe BjVjr + i + z + broa d-band filters 
in the COSMOS field (jCapak et aLll2QQ7h . Finally, we 
include IR images in the Spitzer-IRAC bands over the 
entire COSMOS pointing that are pr ovided by the S- 
COSMOS survey ([Sanders et al.ll2007D . and partial cov- 
erage of the AEGIS pointing ( ^0.15 deg 2 ) overlapp ing 
with the Extended Groth Strip (jBarmbv et all 12008( 1. 

In this study, we use a if-selected catalog generated 
from the 2008A and 2008B semesters only. The AEGIS 
catalog contains 15 filters and the COSMOS catalog con- 
tains 21 filters (u-S/im). The optical and near-IR images 
were convolved to the same point-spread function (PSF) 
before performing aperture photometry, so as to limit any 
bandpass-depende nt effects. The photome try was done 
with SExtractor ( Bertin fc Arnoutsl fl996) in relatively 
small color apertures chosen to optimize the signal-to- 
noise. We determine an additional aperture correction 
from the if -band image that accounts for flux that falls 
outside of the AUTO aperture, the reby enabling us t o 
calculate total magnitudes (see, e.g. lLabbe et al.l l2003h . 
We use an alternative source fitting algorithm especially 
suited for heavily confused images for which a higher res- 
olution prior (in this case, the if -band image) is avail- 
able to extract the photometry from the IRAC imaged. 
T his method is described in more detail in the appendix 
of iMarchesini et al.l ([2009D . 

Using th e EAZY photometric redshift and rest-frame 
color code ([Brammer et al.ll2008[ ). we find the photomet- 
ric redshifts in COSMOS to be in excellent agreement 

9 lhttp://www.cfht.hawaii.edu/Science/CFHTLS/| 
10 Through visual inspections of the residual IRAC images (after 
subtracting the modeled K-detected objects), we estimate that flux 
contributions from undetected K-band sources may influence the 
IRAC photometry for less than 5% of the entire quiescent sample 
studied in this work at a level greater than the formal error bars. 
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with the spectroscopic redshifts mad e publically avail- 
able through the ^COSMOS survey ([Lilly et alJl2007l ). 
with a z /(l-\-z) =0.016 for 632 objects at z spec <1. We 
also find excellent agreement between the photomet- 
ric and spectroscopic redshifts for a larger sample of 
2313 ob jects at z m *r. <l- 5 in AEGIS from the DEEP2 
survey ([Davis et al.ll2QQ3h with a z /(l+z)=0.017. Both 
fields have very few catastrophic failures, with only 
3% >5<j outliers. Spectroscopic redshifts also exist for 
125 Lyman Brea k Galaxies (LBGs) at z ~3 within the 
AEGIS field from lSteidel eTaD (j2003h . for which we find 
a z /(l+z) =0.045, with 10% >5a outliers. However, we 
note that LBGs are very faint in the rest-frame op- 
tical (observer's near-IR) and their spectra have rela- 
tively weak Balmer/4000A breaks. There is excellent 
agreement between the NMBS phot ometric redsh i fts an d 
the Gemini/GNIRS redshifts from iKriek et al.l (|2006h . 
with a biweight scatter in (z phot - ^s P ec)/(l+^s P ec) of 
only 0.010, albeit this is only for four galaxies (see 
Ivan Dokkum et aTll2009[ ). The photometric redshift ac- 
curacies range from a z /(l-\-z)~ 0.01 for galaxies with 
stronger Balmer/4000A breaks to a z /(l+z)~ 0.05 for 
galaxies with less defined breaks. The lack of spectro- 
scopic redshifts above z ~ 1 highlights the necessity 
for both follow-up spectroscopy at high redshift, as well 
as the innovation of new techniques to accurately di- 
agnose the reliability of the photometric redshifts (see 
iQuadri fc Williams! 120091 ). 

From the best-fit EAZY template, we compute the 
rest-frame U-V colors following the method used by 
iWolf et all (|2003D in the COMBO-17 survey. We measure 
the rest frame U-V color from the best-fit temp late, us- 
ing the filter definitions of Maiz Apellaniz (2006). When 
using closely-spaced medium-band observed filters, the 
template fluxes are found to be more robust than inter- 
polating between observed filters (see Brammer et al. in 
prep). The resulting U-V colors have average uncertain- 
ties of ^0.058±0.008 mag for the entire sample of massive 
(>1O 11 M0) galaxies, as determined from both systematic 
uncertainties and Monte Carlo simulations (described in 
detail in §3.2). 

We fit the photometry w ith stellar populat ion synthe- 
sis templates using FAST (jKriek et al.ll2009l h fixing the 
redshift to the EAZY output (or the spectroscopic red- 
shift where available), and determine the best-fit age, 
dust extinction, star formation timescale, stellar mass, 
and star formation rate. The models input to F AST are 
a grid of lMarastonl (|2005f ) models that assume a iKroupal 
(|2001f ) IMF with solar metallicity and a range of ages 
(7.6-10.1 Gyrs), exponentially declining star formation 
histories (7 < r < 10 in log years) and dust extinction 
(0 < Ay < 4). The dust content is par ameterized by 
the ex tinction in the V-band following the iCalzetti et al.l 
(|2000D extinction la^Q 

In this paper, we select massive (>10 11 M Q ) galaxies in 
four redshift bins from the NMBS that sample roughly 
equal co-moving volumes (with the exception of the low- 
est redshift bin). Within this sample of massive galaxies, 
we are interested in understanding galaxies that are on 
the red sequence because they have old, evolved stellar 

11 We note that differe nt dust attenuation laws have little effect 
on quiescent galaxies (see Muzzin et al. 2009). 



populations. At high z, where we currently lack morpho- 
logical information, we must devise a method for selec- 
tion of dusty, star-forming galaxies masquerading as red, 
quiescent galaxies. 

3. THE AGE SPREAD OF QUIESCENT GALAXIES 
3.1. Selecting Quiescent Galaxies 

The color-mass relation is shown in the top panels of 
Figure 1. Clearly, the majority of massive galaxies with 
masses >10 n M are red at all redshifts to z=2. How- 
ever, a significant fraction of these galaxies are red be- 
cause of dust, not because they have old, evolved stel- 
lar populations (e.g. iWvder et al.ll2007t iCowie fc Bargerl 
120081 : iBrammer et all 120091). 

Following iBrammer et al.l (|2009f ). we select quiescent 
galaxies by requiring, 

(U - Vy - A uv - (2.03 - 0.77 • t LB /t H ) > -0.4, (1) 

where (U-V)' is the rest-frame U-V color with the slope 
of the color-magnit ude relation remov ed using the non- 
evolving slope from lBell et al.l (|2004bl ) and Auv is dust 
reddening correction fact or of 0.47 -Ay (the vis ual extinc- 
tion) as derived using the ICalzetti et al.l ([20001 ) exinction 
law. The final term accounts for a mean linear color evo- 
lution with time, where £lb is the lookback time and £r 
is the Hubble time. The average selection given by Equa- 
tion 1 corresponds to the dashed horizontal line at each 
redshift interval. The selection method removes dusty, 
star forming galaxies based on their extinction thereby 
leaving us with the red sequence. This selection is simi- 
lar to and more co nserv ative than the UVJ se lection of 
lLabbe et all (|2005f ) and iWilliams etHI (|2009l ) (see Ap- 
pendix A). 

Comparing the observed U-V color to the color with 
an additional dust-correction Auv (top and bottom pan- 
els in Figure 1 , respectively) , we find that not all massive 
red galaxies are old, as there is a significant population 
of dusty and star for ming massive galaxies at z >1 (see 
IBrammer et a l. 2009). Those star forming galaxies that 
are dusty interlopers are marked as green stars in the 
bottom panels of Figure 1 and removed from the sample 
through the above selection method. Additionally, those 
galaxies that have MIPS detections (>20/iJy) are indi- 
cated with small red circles, but are not removed from the 
sampl43; these objects likely host active g alactic nuclei 
(AGN ) or may be forming stars (see also iDaddi et al.l 
120071 ). We note that the scatter measured in U-V de- 
creases by ~ 0.03 mag at z ~ 0.5 and ^0.01-0.02 at 
higher redshifts when removing all MlPS-detected qui- 
escent galaxies. The extinction correction effectively re- 
duces the number of galaxies in the transition zone be- 
tween the blue cloud and red sequence, revealing distinct 
populations with bimodal intrinsic colors up to at least 
z~2. 

This selection method is enabled by the increased res- 
olution provided by the near-IR medium-bandwidth fil- 
ters. The improved sampling of the SEDs enable ro- 
bust constraints on both the photometric redshifts (~2% 

12 24 /im fluxes from Spitzer- MIPS images provided by the S- 
COSMOS and FIDEL surveys, see lBrammer et al.l (120091 ) for a full 
description. 
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log MyM log M^/M log MyM log M^/M 

Fig. 1. — The rest-frame U-V color with the slope of the CMR removed using a non-evolving slope from Bell et al. (2004b) (top panels) 
as well as a correction for the dust-content (bottom panels) as a function of stellar mass. Those quiescent galaxies with stellar masses 
>10 n Mq on the red sequence are black, filled circles, and the dusty galaxies with obscured star formation are green stars. Small red 
circles indicate galaxies with MIPS detections >20//Jy. The vertical dashed lines identify the lower limit of the mass-selected sample and 
the horizontal dashed lines are the mean selection limits in each redshift bin. The grey lines in the top panels are the 90% completeness 
levels for Ktot >22.8 ABmag at the maximum redshift of the bin. The completeness lines for z<lA fall off the plot, with masses <10 10-2 
Mq. Nearly all massive galaxies are red (top panel), but some of these galaxies have red U-V colors due to dust reddening from obscured 
star formation. When we correct the U-V colors for the amount of dust (from SED modeling), we see a bimodal color distribution at all 
redshifts. 



accuracies compared to the —6-7% accuracies of broad- 
band %>hot when using all available spectroscopic red- 
shifts) as we ll as the dust conten t of the stellar pop- 
ulations (see Br ammer et al.l [2QQ9h . The typical upper 
and lower 68% confidence intervals on the dust extinc- 
tion (for both quiescent and star forming galaxies) range 
from at z — 0.5 to ^o'i at z — 2. Furthermore, the 
inclusion of IRAC data helps constrain the overall shape 
of the SED and significantly improves the confidence in- 
terv als of the dust extinction for the individual galaxies 
fe.g JLabbe et all 120051 : iWuvts et all 120071 : iMuzzin et all 
120091 ). About 20% of the quiescent galaxies have no 
IRAC coverage as we did not make any restrictions in 
our selection method, which may imply tha t their classi- 
ficati on as quiescent is less robust (see, e.g.. lLabbe et al.l 
|2005[ ). We verified that these 20% of objects do not devi- 
ate systematically from the trends we see for the sample 
of objects with IRAC coverage. We note that this selec- 
tion method is consistent with the UV J-selection, which 
does not depend on models (see Appendix A.). 

In Figure 2, we show two example SEDs from the high- 
est redshift bin normalized at 1/im to highlight both the 
improved sampling of the SED as well as the fundamen- 
tal differences between the SEDs of the quiescent and 
dusty galaxies. Although both galaxies have red ob- 
served U-V colors, we see clear distinctions in the con- 
tinuum shapes of the SEDs. The dusty galaxy (right 
panel) has a more gradual slope than the instrinsically 
red, older stellar population (left panel), which shows a 
well-defined 4000 A break with a peaked SED. With the 



increased resolution of the medium-band data, we are 
able to distinguish the continuum shapes and properties 
of galaxies residing on the red sequence to remove dusty 
contaminants such as the galaxy in the right panel. 

The issue of the ongoing star-formation in these galax- 
ies is ambiguous, as this depends on the definition of 
a "quiescent" galaxy and is very model-dependent: for 
an individual quiescent galaxy the la range of specific 
star formation rates (sSFR) from the SED modeling de- 
scribed i n §2 is on average to 0.004 Gyr -1 out to z=2 
(see also iKriek et al.l [2QQ8h . Degeneracies between the 
age, dust, and mass in models complicate matters when 
selecting "quiescent" galaxies based on their sSFRs. The 
observational selection of quies cent galaxies i n this w ork 
is less model dependent than iRomeo et al.l ([20081 ) for 
example, who select quiescent galaxies by sSFR <0.01 
Gyr -1 from their simulations. About —35% of our qui- 
escent galaxies at 1.8 < z < 2.2 appear to have sSFRs 
>0.01 Gyr -1 , with a maximum sSFR of —0.06 Gyr -1 (as 
determined from SED modeling). This fraction decreases 
to -14% by 0.2 < z < 0.9, with a maximum sSFR of 0.02 
Gyr -1 . A sample of quiescent galaxies selected solely 
on best-fit sSFR would be biased against including the 
blue tail of the distribution of galaxies that may be the 
most recent additions to the red sequence. In Appendix 
A, we repeat key parts of the analysis using an alter- 
na tive selection of qu iesce nt galaxies, the UVJ m ethod 
of lLabbe et all (|2005l ) and iWilliams et al.l (|2009D . This 
method naturally isolates a distinct sequence of "red and 
dead" galaxies in the rest-frame U-V versus V-J dia- 
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Fig. 2. — Sample SEDs normalized to 1/im from the highest redshift bin, showing the effects of increased dust content for galaxies with 
red observed U-V colors. The red points are the medium-band filters in addition to the full data (black points), with the best-fit 

FAST template in grey. The two galaxies have similar U-V colors but different amounts of dust reddening; the quiescent galaxy in the 
left panel has a strong 4000 A break and a peaked SED, while the dusty galaxy in the right panel has a much more gradual slope in the 
Balmer-break region, rising to a peak at ~l/im. 



gram. We find that the results remain essentially un- 
changed when this selection is used, but note that the 
UVJ selection includes a slightly higher number of blue 
galaxies at high redshift. 

3.2. Deriving the Intrinsic Scatter in Quiescent 
Galaxies 

In Figure 03 we show that the observed scatter in the 
U-V color (green) increases with redshift for this sample 
of massive, quiescent galaxies. We note that the scatter 
is measured from the actual colors of the quiescent sam- 
ple, not the dust-corrected colors shown in Figure 1, as 
there are degeneracies in the models that limit the level 
of accurarcy of Ay . To interpret this increase we need to 
correct for the scatter introduced by photometric errors, 
thereby measuring the intrinsic scatter. For each quies- 
cent galaxy, the observed w-8/im fluxes were each per- 
turbed by a normally-distributed, pseudo-random num- 
ber from a Gaussian distribution with a mean of zero 
and a standard deviation of the photometric error for 
each respective filter. From these perturbed flux values, 
we generate 50 simulated catalogs and use EAZY to re- 
determine U-V, refitting the photometric redshifts. We 
take the biweight sigma of the U-V distribution for each 
galaxy to signify the uncertainty in the color. The av- 
erage uncertainty in the colors due to photometric error 
for all massive (>10 11 M ) galaxies ranges from 0.008 
mag at z=0.5 to 0.05 mag at z=20 

We calculated the contribution of the photometric er- 
ror to our measurement in the scatter of U-V by creating 
a sample of mock galaxies with an intrinsic scatter of (in 
other words, all galaxies have the same U-V color). Each 
mock galaxy is assigned a value for the scatter in U-V 
that comes from our simulations as described above, and 
we then perturb the colors by an amount that is drawn 
from a Gaussian distribution with width equal to the 
color uncertainty. The scatter of colors in the resulting 

13 In Appendix B, we address the possible concern that our 
photometric errors have been greatly underestimated. 



sample is then taken to be the contribution of photo- 
metric errors to the overall observed scatter. Finally, in 
order to reduce random uncertainties, we repeat this pro- 
cedure 100 times. The contribution to the observed U-V 
scatter from photometric error increases with z, as one 
might expect, from 0.007±0.005 mag at 0.2 < z < 0.9 up 
to 0.05±0.02 mag at 1.8 < z < 2.2, but it is significantly 
less than the observed color scatter at all redshifts. We 
note that the uncertainties in the U-V colors are smaller 
than the observed flux error bars that sample the rest- 
frame U and V regions. The rest-frame U and V fluxes 
are typically calculated from 2-3 medium band filters, 
which reduces the uncertainties. 

The errors in the photometry are very small at low 
redshift; therefore any perturbation of the fluxes does 
not significantly change the photometric redshift and the 
U-V color. There are however systematic uncertainties 
that will contribute to the measured scatter. To assess 
the effects of systematic errors, we use the 75 quiescent 
galaxies at 0.2 < z < 0.9 that have a spectroscopic red- 
shift. We compared the U-V colors measured for these 
quiescent galaxies from the photometric redshifts to the 
color measured using the spectroscopic redshifts. The 
error in redshift is strongly correlated with the result- 
ing error in color, where an under-estimate in z p hot of 
0.1, leads to a U-V color that is ~0.2 mag redder (see 
Appendix C). The normalized median absolute deviation 
of (U-V) spec -(U-V) p hot is 0.05 magnitudes. We conser- 
vatively assume that the contribution to the measured 
scatter due to systematic uncertainties is 0.05 mag at all 
redshifts, but note that this may decrease at higher red- 
shifts as our medium-band filters sample the rest-frame 
U and V filters at z >1. The scatter in the color due 
to measurement error is taken to be the systematic and 
photometric errors added in quadrature (blue points in 
Figure [3]). 

To measure the internal scatter in U-V in Figure [3] 
(black), we subtract the scatter due to measurement er- 
ror (blue) in quadrature from the observed scatter. The 
vertical black error bars are the uncertainties in the me- 
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dian value of the photometric scatter and the horizontal 
black error bars are the bin size. The grey filled region 
indicates the range of <Ju-v values we measure when 
we raise and lower the horizontal limit in Figure 1 by 
0.2 magnitudes, thereby changing the number of quies- 
cent galaxies selected. Regardless of the selection limit, 
we find an increasing intrinsic scatter with redshift. We 
compa re our measurements to work done bv lBower et al.l 
(j!992[ ) in the Coma Cluster, as well as a study of the 
massive (Md yn >10 n M^) early- type galaxies in 7 clus- 
ters from z=0.18 to 0.84 ( Ivan Dokkum 2008). We find 
that the scatter at z=0.5 is similar to th e intrinsic scat- 
ter of field galaxies recently measured by iRuhland et al.l 
(|20Q9[P^L but a factor of 2 higher than that measured 
previously in clusters at these redshift s. This may reflect 
a systematic difference between field and cluster galax- 
ies or some unrecognizable error contribution within our 
data. We note that if we limit the quiescent sample 
to the 75 galaxies with a spectroscopic redshift between 
0.2 < z < 0.9, we find a very similar ( somew hat larger) 
scatter. Ivan Dokkum fc van der M arel (2003) found that 
the massive red sequence galaxies in clusters are typically 
~0.4±0.2 Gyr older than field galaxies. If quenching in 
the cluster environment occurred 0.4 Gyr before quench- 
ing in the field, our models (described in §5) suggest that 
the intrinsic scatters should be lower by only ~0.02 mag, 
rather than the ~0.05 mag difference shown in Figure [31 
From Figure [3j we see that the intrinsic scatter in the 
colors of quiescent galaxies increases out to z ~ 1 and 
likely continues to increase to z~2, although there may 
exist selection biases in this regime. It seems likely that 
th e trend at 1 < z < 2 is real given the recent results 
of iHilton et al.l (j2009[ ). who measure an intrinsic zgso- 
J scatter (close to the rest-frame U-V) of 0.123±0.049 
magni tudes for a cluster at z = 1.46, and lPapovich et all 
(|2010D . who find cttj-b=0.136±0.024 magnitudes for a 
cluster at z = 1.62. Furthermore, the color scatter is 
in qualitative agreement with the predictions of simple 
passive evolution models that the scatter in color should 
decrease as the galaxies evolve. 

3.3. Origin of the Intrinsic Scatter in Quiescent 
Galaxies 

To understand the origin of the intrinsic scatter in U- 
V , we consider the properties of these stellar populations 
through spectral synthesis modeling. The scatter in the 
red sequence is thought to be determined by the scatter 
in both the age and metallicity, where the scatter in rel- 
ative ages is generally thought to be the dom inant driver 
(e.g. iBower et al.lll992l : iGallazzi et al.ll2006f ). The range 
of metallicities observed for local red sequence ellipticals 
from SDSS DR2 is -0.8-1.6 Z© from the faintest to the 
brightest gal axies, with a q-r colo r scatter of —0. 04-0. 05 
magnitudes (|Gallazzi et al.ll2006h . In this work, we fix 
the metallicity to Z , but we cannot exclude that metal- 
licity variations contribute to the scatter. We test the 
effects of metallicity on the scatter in U-V color in Ap- 
pendix E, finding a weak and opposite trend for the color 
scatter due to metallicity variations with redshift. Given 
the observed scatter in Figure [3j it is unlikely that metal- 
licity has a large effect on this work. 
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Fig. 3. — The observed (green) and intrinsic (black) scatter in 
the rest-frame U-V colors of the most massive, quiescent galaxies 
selected by their extinction-correct U-V colors. The scatter due to 
measurement errors (blue) is subtracted from the observed scatter 
in quadrature. The vertical black error bars mark the 68% confi- 
dence intervals due to photometric errors and the grey filled region 
indicates how crjj-V changes when we raise and lower the horizon- 
tal selection limit in Figure 1 by ±0.2 magnitudes. The dashed 
line is the expected evolution of the intrinic scatter due to passive 
evolution for galaxies that started forming stars at z=3 with a char- 
acteristic timescale for transformation into an early-type galaxy of 
1.4 Gyrs, with a large burst of star formation before transforming. 
The observed U-V data points used to calculate teh color scatter 
as a function of redshift are plotted in the bottom panel. We find 
that the scatter in U-V increases towards higher redshift, where the 
intrinsic scatter is significantly higher than the scatter introduced 
by photometric and systematic errors at z~2. 



The age as defined in stellar populations models 
strongly depends on the choice of star formation his- 
tory. To simplify, we assume an expontentially decaying 
star formation history with r=0.1 Gyr and < Ay < 
3. Using the ste llar popu lation synthesis code FAST 
(jKriek et all2009D with the iMarastonl (|2QQ5[ ) models and 
a lKroupal (|2001f ) IMF, we fit for the age of these stellar 
populations given the above constraints. Given our as- 
sumptions, we effectively measure the relative ages (and 
dust) of the quiescent galaxies, fixing all other parame- 
ters. 

Absolute ages may not be very meaningful given our 
modeling approach, but the relative ages of blue and red 
galaxies are robust if we assume that they have similar 
star formation histories. In Figure HJ we plot the best-fit 
age of all quiescent galaxies as a function of their rest- 
frame U-V color. The reddest galaxies have systemati- 
cally older stellar populations than the bluest galaxies at 
all redshifts. Given the assumption of solar metallicity, 
there exist a few quiescent galaxies consistent with the 
age of the universe (dotted horizontal lines) at z>l. We 
include the typical 68% confidence intervals in the ages 
in Figure |4] from the modeling described in §2, rather 
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than the modeling in §3.3, as the average uncertainty is 
much smaller than what we expect given our assump- 
tions in the modeling due to degeneracies between age 
and metallicity. Additionally, we plot the typical random 
and systematic errors in the U-V color as a function of 
redshift in Figure HJ At z<1.5, the red and blue quar- 
tiles have the same low-level of dust reddening, whereas 
at z >1.5 the red quiescent galaxies have ~0.2 magni- 
tudes more extinction. 

3.4. Rest-Frame SEDs of Quiescent Galaxies 

We note that the general trends in Figure |4] do not 
prove that we are measuring age differences, as bluer U- 
V colors will force a lower age in the fit, even if the range 
in U-V is due to photometric scatter only (see Appendix 
B for more details on the effects of photometric scat- 
ter). In the previous section we just considered the U-V 
colors, we will now investigate the shape of the full ob- 
served rest-frame SEDs of these galaxies to see if there is 
a fundamental difference between the reddest and bluest 
galaxies on the red sequence. 

Stellar populations with ages of ~0.8 Gyr will have 
strong Balmer breaks at 3646A, whereas older popula- 
tions will have pronounced 4000 A breaks. Although the 
differences will be subtle, the SEDs of galaxies with ages 
of ~0.8-l Gyr should be distinct from older stellar popu- 
lations. This has not been observed previously at high z 
because the J, H, and K broad-band filters smooth over 
these spectral features. Figure[5]demonstrates the impor- 
tance of the spectral resolution by comparing the com- 
posite SED of the reddest quartile at 0.9 < z < 1.4 (light 
grey points) to models smoothed to the resolution of the 
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Fig. 4. — These panels contain the ages of quiescent galaxies (as 
derived from spectral synthesis models in §3.2) as a function of 
rest-frame U-V color, in four redshift bins. The reddest and bluest 
U-V quartiles are denoted by the colors red and blue, respectively, 
while the grey points are the middle quartiles. The dotted line 
signifies the maximum age of the universe within each redshift bin. 
The bottom panels contain histograms of colors, where the red and 
blue dotted lines in the bottom panels are the median colors of the 
two quartiles and the black, dashed line is the median color over the 
entire redshift interval. The general trend at all redshifts is that 
redder galaxies are typically older than bluer quiescent galaxies. 
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Fig. 5. — The effect of resolution can be seen in this figure, where 
the thin black line is the average best-fit model to the rest-frame 
composite SED from the medium-band measurements of 46 red- 
dest galaxies over the redshift range 0.9 < z < 1.4 (also shown 
in Figure [6}- If this model is then smoothed with a Gaussian 
with a width equal to the resolution of the medium-band filters, 
0.15/im/(l+z), we see that the smoothed model at medium-band 
resolution (thick black line) is smoother than the model at original 
resolution, but still follows the shape of the Balmer/4000A-break 
region closely. Note that we mark the location of the 4000A-break 
with an arrow. We then smooth the models with a Gaussian that 
has a width equal to the typical resolution of broad-band filters, 
0.4/xm/(l+2) (dashed line). The Balmer/4000A-break has become 
washed out due to the poor resolution. 



medium- and broad-band filters. For the first time, we 
are able to distinguish these features with the NMBS due 
to the increased sampling of the Balmer/4000A break re- 
gion by the medium-band filters and the accurate pho- 
tometric redshifts. We therefore consider the rest-frame 
SEDs of these quiescent galaxies to understand if the in- 
trinsic scatter we measure is really due to the ages of 
the galaxies. Specifically, we are interested in the SEDs 
of the reddest and bluest quartiles (red and blue, respec- 
tively, in Figure HJ. If the scatter is due to age as implied 
from the spectral synthesis modeling, we should see an 
increasing dichotomy in the SEDs of these quartiles as we 
look to higher z, where we observe an increasing internal 
scatter in color. 

With the improved resolution and photometric redshift 
accuracies of the medium-band filters, we are able to 
measure the relative ages of a complete sample of massive 
galaxies inhabiting the red sequence out to z ~2 for the 
first time. The rest-frame SEDs of these massive galaxies 
are plotted in Figure [6j including all individual observed 
fluxes from u-8fim shifted to the rest-frame for all qui- 
escent galaxies, normalized at 7000A. We consider the 
SEDs of the reddest and bluest quartiles in U-V (shown 
in the inset U-V histograms), where the observed fluxes 
of the galaxies with the reddest colors are indicated with 
red points and the bluest galaxies with blue points. The 
median best-fit spectral synthesis model templates for 
the reddest and bluest quartiles are the solid lines in Fig- 
ure El using the FAST settings as described in §3.3. The 
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Fig. 6. — The composite rest-frame SED of massive, quiescent galaxies from z=0.2 to 2.2 in four redshift bins (top to bottom), with the 
reddest and bluest U-V quartiles selected in Figure |J] color-coded the same. From left to right, we see the reddest composite SED, the 
bluest composite SED and finally the composite SED of all quiescent galaxies in grey with the two quartiles overplotted in the right panels. 
The solid lines are the median best-fit templates as derived from the spectral modeling analysis in §3.2, smoothed with a Gaussian with a 
width equal to the average rest-frame medium-band filter resolution of 0.15/im/(l+z). The redshift range is labeled in the top left for each 
bin, and the total number of galaxies in each panel is indicated in the top, left corner. The sub-panels in each redshift range on the right 
are the histograms of {U-V)\ U-V with the slope of the color-magnitude relation removed. The spectral shapes of the reddest and bluest 
quartiles are very similar at low redshift. At high redshift, there exist old quiescent galaxies (red) and young quiescent galaxies (blue) that 
leads to a relatively large observed intrinsic scatter in U-V . 
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models are shown in Figure [6] with Gaussian smoothing 
by the average rest-frame distance between the medium- 
band filters of ~ 0.15/im/(l+z). Notice that there is 
more smoothing at lower redshift (due to the dependence 
of the resolution on the scale factor) and we therefore 
expect to resolve the Balmer/4000A-break region with 
higher resolution at high- z because the medium-band fil- 
ters span a shorter wavelength range. 

We see an increasing dichotomy between the SEDs 
of the reddest, oldest galaxies and the bluest, youngest 
galaxies relative to 7000A with the increasing intrinsic 
scatter in color. The SEDs are not only different in U- 
V, which is how they are selected (and also template 
dependent), rather at all rest-frame wavelengths. Fur- 
thermore, we plot all observed fluxes for our sample of 
galaxies in the right panels of Figure [6] to demonstrate 
the robustness and fundamental differences between the 
reddest and bluest galaxies on the red sequence. The re- 
maining 50% of the sample that have intermediate U-V 
colors (grey points) bridge the gap between the SEDs of 
the reddest and bluest quartiles at z~2. With the com- 
posite rest-frame SEDs, we sample the Balmer/4000A 
break region with sufficient resolution to distinguish be- 
tween a Balmer- or 4000A-break. We note that the mod- 
els do not fit the lowest redshift galaxies very well, tend- 
ing to force slightly larger differences between the quad- 
rants at 0.4 < A r est < 0.6/im than observed. As a test, 
we have included different metallicities in the modeling 
described in §3.3, and we find that they reduce the dis- 
crepency between the best-fit model and data points seen 
for the reddest galaxies at z < 1.5 slightly, but not en- 
tirely. 

In the high redshift bins, we see that the bluest, 
youngest galaxies emit more radiation than the reddest 
galaxies at all wavelengths < 7000 A. In particular, the 
region around 0.4-0.5/im and <0.3/mi (blueward of the 
U -band) show clear differences between the red and blue 
quiescent galaxies. These galaxies that populate the blue 
tail of the distribution have younger ages and bluer col- 
ors at higher redshifts. Focusing on the rest-frame UV 
wavelength regime, we see a factor of >2 increase in the 
continuum emission for the blue quartile with increas- 
ing redshift. Additionally, we see the flux in the Balmer 
break region grow in strength with increasing redshift. 

While the SEDs of the bluest galaxies undergo signif- 
icant evolution from z=0.2 to 2.2, the oldest galaxies 
remain virtually unchanged. The old galaxies inhabit 
a very small range of U-V colors with uniform spectral 
shapes at all redshifts. The rest-frame UV continuum 
is very faint and the galaxies have strong (smoothed) 
4000A breaks. We conclude that the increase in the U-V 
scatter is driven by an increase in the dispersion of spec- 
tral types; there is a significant population of younger, 
blue quiescent galaxies at z ~2 that are nearly absent 
by z ~0.5. The observed rest-frame SEDs confirm that 
the major driver of the intrinsic scatter in color of the 
red sequence is the scatter in age, as the stellar pop- 
ulations originate from varying epochs in the universe. 
In Appendix B, we show that our results are robust 
against severe underestimates of the photometric errors, 
and against large systematic redshift errors. Addition- 
ally, we test the effects of dust and ongoing star forma- 
tion on the composite SEDs in Appendix D. 



4. THE REDDEST GALAXIES AT Z~2 

iKriek et all (|2006D (hereafter K06) were the first to 
show that many massive high- z galaxies have no (strong) 
emission lines and have strong Balmer/4000A breaks 
within their spectroscopic sample of 9 quiescent galax- 
ies. The K06 sample of quiescent galaxies had relatively 
young ages of 0.5-1 Gyr. Perhaps the most striking fea- 
ture of Figures [4] & [6] is that we see evidence for very 
old galaxies at z ~2 that seem to have more evolved stel- 
lar populations than the K06 sample, with higher M/L 
ratios. 

Selection effects may explain why the K06 did not 
find the more evolved galaxies we observe with the 
NMBS. The K06 NIR spectroscopic survey selected tar- 
gets from the MUSYC if-selected pho tometric catalog 
([Gawiser et a l. 2006; Quadri et al.ll2007[ ), choosing galax- 
ies with photometric redshifts of z ~2.3. The targets were 
selected to a limiting magnitude of 21.6 in AB magni- 
tudes to ensure an adequate signal-to-noise ratio for the 
NIR spectra. 

In order to illustrate the difference between a 
magnitude-limited sample and a mass-limited sample, in 
Figure [71 we show the binned U-V colors and ages (as 
derived in §3.3) as a function of the total if -band mag- 
nitude for all quiescent galaxies with photometric red- 
shifts between 2=1.8 and 2.2. Brighter galaxies tend to 
have bluer colors and younger ages. We also note that we 
are observing an inversion of the color-magnitude rela- 
tion. In Figure [7J the dashed line is the median if -band 
magnitude of the 9 spectroscopically-confirmed quiescent 
galaxies from K06 and the dotted line is the limiting mag- 
nitude of the spectroscopic survey. Both limits have been 
increased by ~0.4 magnitudes to account for the differ- 
ence in median redshift: z~2.3 for K06 and z~2 for our 
sample (assuming no evolution). The magnitude-limited 
sample would not include galaxies that lie in the hashed 
region, thereby biasing the sample against the reddest 
and oldest galaxies. 

In §3.4, we showed that the spectral shapes of the red- 
dest and bluest galaxies on the red sequence are funda- 
mentally different at z ~2. We compare the SEDs of 
the spectroscopically-confirmed quiescent galaxies from 
K06 to our mass-selected sample in Figure [51 The me- 
dian rest-frame spectra of the K06 sample of quiescent 
galaxies binned to low resolution (grey shaded region) 
and smoothed with a Gaussian to the medium-band res- 
olution matches the median SED of the bluest quiescent 
galaxies in our sample. 

We cannot exclude the possibility that the differences 
between the spectra arise because we are comparing the 
K06 sample to galaxies at a slightly lower redshift. Be- 
cause the galaxies in the spectroscopic sample will have 
aged by~0.4 Gyr between (z)=2.3 and 2, we fit the K06 
spectra with a r-model using the settings described in 
§3.3 finding a best-fit age of 0.8 Gyr, and age this best-fit 
model by +0.4 Gyr (black line in Figure [8]). The median, 
aged K06 spectra lies roughly between the two extremes 
of our sample, which implies that typical z=2 galaxies 
could be descendants of the z ~2.3 galaxies of K06. It 
will be interesting to see whether very old galaxies exist 
at z~2.5 and beyond. 

Figures [7] and [8] illustrate the differences between mass- 
limited and magnitude-limited samples. Although a sim- 
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Fig. 7. — The binned U-V color and ages of all quiescent galaxies 
as a function of their total X-band magnitudes. The dashed line is 
the median K-band magn itude of the 9 s pectroscopically-connrmed 
quiescent galaxies by (Kriek et al. 2006) and the dotted line is the 
limiting magnitude used by Kriek et al. to select their sample. The 
reddest and oldest galaxies in the hashed region would therefore 
not be included in the Kriek et al. magnitude-limited sample. 
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Fig. 8. — The composite rest-frame SED of the reddest and bluest 
U-V quartiles of massive, quiescent galaxies from 2=1.8 to 2.2 (red 
and blue points). The solid lines are the median best-fit tem- 
plates as derived from the spectral modeling analysis with Gaus- 
sian smoothing to the medium-band resolution of 0.15yum/(l+z), 
compared to the median binned spectra of 9 quiescent galaxies at 
z ~2.3 from IKriek et al.l (120061 ) (dot-dashed line, also smoothed 
to the medium-band resolution). The Kriek et al. sample match 
the spectral shape of the blue quartile. We also fit the Kriek et 
al. median spectra with a r-model (see §4) and aged this best-fit 
model of 0.8 Gyr by +0.4 Gyr to evolve the spectra to the average 
redshift of the medium-band data of z ~2 (black line). The me- 
dian, aged K06 spectra lies roughly between the extremes of our 
sample, which implies that typical z~2 quiescent galaxies may be 
the descendents of the K06 sample. 




pie magnitude limit is practical, one must be aware of 
the ways in which the limit can introduce biases into the 
galaxy sample. With our complete, mass-limited simple, 
we are able to quantify the build up of the massive end of 
the red sequence by using our observations to constrain 
models. 

5. BUILD UP OF THE RED SEQUENCE 

Given the observed evolution in the U-V scatter and 
the trends with age, we next look to simple models to ex- 
plain the build up of the red sequence. In Figure [9j we see 
that the fraction of massive galaxies with intrinsically red 
colors due to evolved stellar populations increases with 
time. At higher redshift, there is an increasing fraction 
of massive galaxies that are dusty and star forming, in- 
habiting the region close to the green valley in the CMR. 
About 90% of the most massive galaxies are quiescent 
at z<\. We note that our sample of massive galaxies is 
selected with a fixed mass limit of 10 11 M at all red- 
shifts and will therefore not account for mass evolution. 
To test how sensitive the fractions are to the adopted 
mass limit, we also select a sample of massive galaxies 
at a co nstant number density of n = 2xl0 -4 Mpc -3 fol- 
lowing |van~^okkum3iFi2 (|2010l ). The resulting fraction 
of quiescent and active galaxies is nearly identical to the 
fractions for a fixed mass limit. The uncertainty in the 
fraction of quiescent galaxies due mass evolution are in- 
dicated with the error bars in Figure We also show the 
range of fractions that result when the selection limit (in 
Figure 1) is varied by ±0.2 magnitudes (grey filled re- 
gion). 

Our results are consistent with many high redshift 
studies, which have found a significant f raction of qui- 
escen t galaxies in place at z >2 (e.g. iFontana et al.l 
120091 : 1 Williams et~atll2009[ ). We find that the most mas- 
sive objects at z=1.5-2 are divided roughly equally be- 
tween s tar forming and quies cent galaxies, the sa me re- 
sults as lWilliams et all (|2009D and IFontana et all ([2 009). 
Consistent with previous work (e.g. IKriek et al.l 120081 : 
iDamen et al.ll2009l : IFontana et al.ll20"09l ) , we find that the 
massive, quiescent galaxies observed at z ~2 assembled 
most of their stellar mass at higher redshift s. 

The observed evolution of the fraction of quiescent 
galaxies and the color scatter can constrain simple pas- 
sive evolution models with " progenitor bias"F^l. The Pas- 
sive Evolution CalculatoiH (jvan Dokkum fc Franx1l2001[ ) 
models the evolution with three parameters: t s tart? r stop 
and /*. The parameter t s tart is the time when star forma- 
tion starts and the parameter T st0 p is the characteristic 
scale for the probability distribution of the times when 
star formation stops. The variation of the star formation 
rate is accounted for with the dimensionless parameter 
/*; the star formation history can include bursts and in- 
creasing or decreasing continuous formation rates as a 
function of time. Once the star formation is quenched in 
galaxies, the luminosity and color evolution is well ap- 
proximated by a single-age population of stars with the 
same luminosity- weighted mean age (see van Dokkum & 

15 At high z, young early- type galaxies will drop out of samples 
creating a biased subset of the low z sample that contains only the 
oldest progenitors of today's early- types. This effect is known as 
"progenitor bias" and leads to an underestimate of the observed 
redshift evolution of the mean luminosity and colors of early- types. 

16 http:/ /www. astro. yale.edu/dokkum/old. index. html 
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Franx 2001). The parameter /* can range between and 
1; the limiting case of f* ~0 implies that most of the 
stars formed close to the start of star formation, 7*^0.5 
is a roughly constant star formation history and when 
/* the luminosity- weighted mean age of the stellar 
populations is dominated by a burst at the end of the 
star formation history. These models can be applied to 
observations of early- type galaxies, enabling fits to the 
early-type galaxy fraction, the mean color evolution and 
the scatter in the CMBEH 

Here, we use the Passive Evolution Calculator to pre- 
dict the redshift evolution of U-V, the rms scatter in 
U-V and the fraction of today's early- types that are in 
place by redshift z for early-type galaxies that started 
forming stars at z=3. We assume the timescale for trans- 
formation to quiescent galaxies is zero (in other words, 
the truncation of star formation is instantaneous) and 
vary the probability distribution of stop times for trans- 
formation from star forming galaxy to quiescent galaxy, 
T st0 p, and the form of the star formation history of the 
individual galaxies, /*. By matching the predicted in- 
trinsic scatter in U-V to the observed scatter, we find 
a best-fit characteristic timescale for transformation of 
7"stop= 1-4^0.5 Gyrs and a luminosity weighted formation 
time of the stars of /*=0.92t° °j?. The best-fit model 
broadly reproduces the observed scatter in Figure [3] and 
the early- type fraction in Figure [9j although the fit is not 
particulary good at high redshift. 

The resulting best-fit values result in low values of 
Tstop, which implies that the most massive early- type 
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Fig. 9. — The fraction of massive galaxies (>lO n M0) that are 
quiescent (black squares) and still star forming (green stars). Ad- 
ditionally, the quiescent galaxies are broken down by those old 
galaxies that have stellar populations >1.5 Gyr (red circles) and 
those that are younger with ages <1.5 Gyr (blue triangles), as de- 
rived from the SED modeling described in §2. The black dashed 
line is the expected evolution of the fraction of early-type galax- 
ies from passive evolution given the observed scatter in U-V. At 
z~1.5-2, roughly half of the most massive galaxies are quiescent. 

17 The parameter At in van Dokkum & Franx 2001 is set to 
zero, as we are not concerned with the time when a galaxy is mor- 
phologically identified as early-type. 



galaxies mainly formed at high redshift . This result is 
consistent with iMarchesini et al.l (j2009[ ) , who find that 
there is very little evolution in the number density of the 
most massive galaxies from to 1.5, pointing towards 
a high formation redshift for the most massive galaxies in 
the universe. Furthermore, the relatively high values of 
/* indicate that these galaxies experienced a burst of star 
formation close to the end of the period of star forma- 
tion. These results indicate that massive, dusty galaxies 
with high star formation rates may be the progenitors of 
the most massive quiescent early- type galaxies. Accord- 
ing to this simple model, about ~20% of the massive, 
dusty star forming galaxies at z ~2 have specific star for- 
mation rates >1 Gyr -1 . This resevoir of massive dusty 
star-forming galaxies will likely soon quench and migrate 
to the red sequence at lower redshift s. 

The evolution of the red sequence may involve more 
complicated processes beyond simple passive evolution. 
For example, we see that the observed fraction of qui- 
escent galaxies evolves more strongly beyond z ~1.5 in 
Figure [9j the same epoch where the color evolution seems 
to be inconsistent with passive evolution. The predicted 
evolution of the early-type fraction from passive evolu- 
tion (dashed line in Figure [9]) is less steep than the ob- 
served evolution at z>l.b. Us ing a very similar se l ection 
of galaxies from the NMBS, Ivan Dokkum et all (|2010D 
calculate that the star formation rate drops by a factor 
of 20 from z=2 to 2=1.1, whereas the stellar mass grows 
only by a factor of 1.4 in the same redshift interval. This 
rapid evolution implies that more complicated processes 
may govern the quenching of star formation. Addition- 
ally, once galaxies have moved to the red sequence there 
may still be other physical mechanisms acting aside from 
simple passive evolution. 

We plot the mean U-V color as a function of redshift 
in Figure [10] and compare this with the predictions from 
the models, which are forced to agree with the mean 
U-V color at z=0.5. It is interesting that there is no 
evolution in color between z=2 and 1.6, as this implies 
that this epoch is where the red sequence is most heavily 
supplemented with newly quenched galaxies. Although 
the cttj_v and early-type fractions appear consistent with 
passively evolving galaxies with a range of truncation 
times at z <1.5, the observed color evolution is clearly 
inconsistent. Even the most extreme case of the color 
evolution of a single stellar population (SSP) formed at 
a redshift of infinity is inconsistent with the z ~2 mean 
color. The grey filled region shows the selection effect of 
changing the color limit by ±0.2 magnitudes. Reducing 
the lower boundary of the U-V color selection will result 
in a bluer mean color at z ~2, but it cannot reconcile 
the difference between the data and the best-fit model 
from the color scatter (although the extreme case of a 
SSP formed at z = oo does agree for the lowest mag- 
nitude selection limit only). This discrepancy between 
the observed color evolution and the predictions from 
passive evolution has been noted in other studies (e.g. 
iKriek et~ al. 2008), and there are several mechanisms that 
may slow the color evolution of galaxies. Galaxies may be 
dustier at higher redshift, which will redden the intrinsic 
colors. A visual extinction of ~0.2-0.3 (which would be 
consistent with the observed SEDs) will redden the U-V 
color by ~0.1 mag, which is similar to the offset we see at 
z ^1-1.5 between our observed color and that predicted 
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Fig. 10. — The mean rest-frame color with the slope of the CMR 
removed, (U-V)' , as a function of redshift. The color evolution 
from simple passive evolution models that include growth of the 
red sequence by transformation of blue galaxies predict too strong 
a color evolution, espeically the best-fit model to the color scatter 
(dashed line). The color evolution of a single stellar population 
forming at infinity (dashed-dot line) disagrees at high redshift, 
where the mean color flattens between z ~1.5-2. Other mecha- 
nisms such as dust, dry mergers or "frostings" of young stars may 
be important to slow the color evolution. 



from passive evolution given the observed color scatter 
(dashed line in Figure [TO]) . The median visual extinc- 
tion, Ay, from the spectral synthesis modeling described 
in §2 of the quiescent ga l axies in our sample is 0.3 mag. 
Regardless, Krie k etaLl ([20081 ) find that dust does not 
provide a solution as the absolute color evolution for so- 
lar metallicity shows that the z ~2 galaxies are not too 
red, rather the z~0 galaxies are too blue. We note that 
in this paper we simply normalize the colors at z=0.5 
and do not consider the absolute values. We explore the 
effects of dusts on the spectral shapes in more detail in 
Appendix D, finding that dust may be a secondary effect 
on the composite SEDs in Figure [6] 

In addition to dust extinction, a plausible mechanism 
to weaken the color evolution of these quiescent galax- 
ies includes recent low level bursts of star formation. If 
these galaxies have experienced a recent burst of star 
formation, a frosting of young stars should be evident in 
the SEDs of these galaxies. The continuum shape would 
appear to be a composite of an old stellar population 
with a strong 4000A break and a young component visi- 
ble in the rest-frame UV. We see some evidence for this 
amongst a fraction of these quiescent galaxies; roughly 
10% of the highest redshift galaxies have negative slopes 
in the rest-frame UV, parameterized by Fa oc \P . How- 
ever, a frosting of young stars may be too transient a 
feature to effect the mean color evolution. We test how 
ongoing star formation would effect the composite SEDs 
in Appendix D, finding that a more complex star for- 
mation history cannot reproduce the trends of spectral 
shape with U-V color. 

Following the work of lKriek et al.l (|2008f ) , another pos- 



sible mechanism that would slow the color evolution of 
quiescent galaxies is red mergers. With deep observa- 
tions of a sample of nearby b ulge- dominated early - type 
galaxies, Ivan Dokkuml (|2005D and iTal et all (|2009l ) find 
morphological signatures of tidal interactions and infer 
that red mergers may lead to a factor >2 evolution in 
the stellar mass density of luminous red galaxies at z<l. 
Red (dry) major mergers of two red sequence galaxies 
will shift the galaxies towards higher masses for the same 
color and ages. While the mean ages of the stellar pop- 
ulations are unchanged, red mergers will therefore slow 
the color evolution and probably become significant at 
z<l-1.5. It is unclear what the dominant mechanism is 
that slows the color evolution of red sequence galaxies, 
perhaps all of the methods described above are impor- 
tant at different epochs. 

6. SUMMARY AND CONCLUSIONS 

In this work, we have selected a complete, mass- 
selected sample of quiescent galaxies out to z=2.2 
through a selec tion based on extinct ion corrections to 
the U-V colors ([Brammer et al .11200 9). We first demon- 
strate that we can measure accurate rest-frame colors at 
z= 1-2 using the medium-band filters. We find that the 
intrinsic scatter in the rest-frame U-V color of quiescent 
galaxies increases with redshift, measuring a scatter that 
is > 2 times larger than previous cluster measurements 
at z ~ 0.5 (see Table 1 for a data summary). This mea- 
surement implies a potential real diffence between the 
evolution of the scatter in the field and in the cluster 
environment. If confirmed, this may suggest that quies- 
cent cluster galaxies are older than quiescent field galax- 
ies, although the difference would be somewhat larger 
than found previously from the fundamental plane (see 
§3.2). The scatter can be explained by trends of U-V 
with the relative ages of the stellar populations. Those 
galaxies with intrinsically redder U-V colors have older 
stellar populations than those galaxies in the blue tail 
of the distribution. To bolster the argument, we show 
the composite rest-frame SEDs for all galaxies in the 
sample. The rest-frame SEDs highlight the usefulness 
of the NIR medium-bandwidth filters in sampling the 
Balmer/4000A-break region at a higher resolution than 
the tradition broadband filters. The observed SEDs show 
a very clear trend of an emerging spread between quies- 
cent red, old galaxies and quiescent blue, younger quies- 
cent galaxies that exists up to and beyond z~2. 

The main result from our paper is that we find both 
young and old quiescent galaxies at z ~2, leading to 
a relatively large scatter in the U-V colors of massive, 
quiescent galaxies at these redshift s. The presence of a 
large population of young, quiescent galaxies implies that 
galaxies were strongly forming stars shortly before star 
formation ended. As shown in §5, parametarizing the 
star formation history by /*, which describes the lumi- 
nosity weighted formation time of the stars for a model 
that includes progenitor bias and passive evolution, we 



find /*=0.92 



+0.03 
-0.05- 



This star formation history predicts 



massive galaxies with high sSFRs, which are seen at z~2 
in Figure 1. We suspect that these massive, star- forming 
galaxies at z ~2 are the progenitors of massive galaxies 
at lower redshifts and a few may possibly be detected as 
sub-mm sources. 
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TABLE 1 

Data Results Summary 





Number 


Observed Scatter 


Measurement Scatter 


Intrinsic Scatter 


Mean 


Quiescent 


Redshift 


of Galaxies 


cru_ v (mag) 


cru_ v (mag) 


cru_ v (mag) 


(u-vy 


Fraction 


0.2 < z < 0.9 


203 


0.108 


0.050 


0.095 


1.883 


0.91 


0.9 < z < 1.4 


175 


0.153 


0.052 


0.144 


1.726 


0.85 


1.4 < z < 1.8 


138 


0.178 


0.056 


0.168 


1.623 


0.70 


1.8 < z < 2.2 


54 


0.200 


0.069 


0.188 


1.628 


0.38 



The significant population of old galaxies at z ~2 
pushes back the star formation epoch of the oldest mas- 
sive galaxies. Galaxies with dominant 4000 A breaks 
(not post-starburst) comprise ~20% of the total pop- 
ulation of massive galaxies at z ~2. It is interesting to 
speculate whether the z=7-S star-forming galaxies that 
have been found recen t ly in the Hubble U ltra-Deep Field 
(jBouwens et all 120091 : lOesch et all l2010f ) could be the 
progenitors of these galaxies. Their stellar masses are 
10 9-io Mq (|Labbe et al.ll2010D . which may imply that 
substantial growth would be necessary between z=6 and 
z= 2. It may also be that more massive galaxies exist 
than have been found so far in the fields studied at these 
high redshift s. 

Using our high quality SED modeling (including opti- 
cal through IR photometry, medium-band NIR photome- 
try and accurate photometric redshifts), we find galaxies 
that appear to be nearly the age of the universe, even 
at these high redshifts. The oldest galaxies observed in 
this study may have been under-represented in the spec- 
troscopic st u dies of red sequence galaxies at z ~2.3 by 
iKriek et all (|2006l 120081 ). Kriek et al. draw their tar- 
gets from a larger sample of galaxies selected to a lim- 
iting K-band magnitude of 21.6 in the AB magnitude 
system. Because they use a magnitude limit, they may 
have missed the oldest galaxies in the universe. How- 
ever, deep follow-up spectroscopy of one quiescent galaxy 
shows that this galaxy, which was initially classified as a 
post-starburst system, was actually much older and dom- 
inated by a 4000 A break (jKriek et al.ll2009f ). Although 
we have good reason to believe t his galaxy is not repre- 
sentative of the full sample (see IKriek et a l. 2009), this 
case illustrates that caution is required. It is also possi- 
ble that the differences with the Kriek et al. sample may 
simply be due to the slightly different mean redshifts. 
In any case, this work highlights the importance of un- 



derstanding how a simple magnitude limit can introduce 
biases into a galaxy sample. With increasing numbers 
of high-quality surveys becoming public, it is important 
to select samples to some limiting mass to probe the full 
range of properties of galaxies. 

Future studies combining the NMBS with HST WFC3 
imaging will enable more detailed studies of the sizes and 
morphologies of the most massive galaxies at high red- 
shift. It would be interesting if there exists a relationship 
between U-V and the size of these galaxies, where the 
reddest quiescent galaxies may also be the most compact. 
Or perhaps the earliest compact, quiescent galaxies may 
have had time to grow through mergers between z ~3 and 
2 and ar e therefore larger tha n the recently "quenched" 
galaxies (M ancini et al.ll2010f ). Whatever the case may 
be, size measurements of this sample of quiescent galax- 
ies will provide constraints on galaxy evolution scenar- 
ios linking the k nown compact, quiescent systems at 
high r edshift (e.g.Uhliillo et a l. 2006; van Dokkum et al.l 
120081 : iCimatti et al. 2008) to the early-type galaxies in 
the local universe. 
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APPENDIX A. UVJ SELECTION OF QUIESCENT GALAXIES 

In this paper we select galaxies based on their extinction-corrected U-V colors. We sh ow here that this selec- 
tion method is simila r to a selection based on the rest-frame U-V and V-J colors used bv lLabbe et al.l (|2QQ5l ) and 
I Williams et al.l (|2009[ ). This alternative color-selection has been shown to cleanly separate quiescent and star- forming 
galaxies. Two distinct populations exist in the U-V versus V-J color-space; the red, quiescent galaxies tend to lie in 
a clump, while the star- forming galaxies follow a well-separated track that extends from blue to red U-V colors with 
increasing amounts of dust extinction. 

We applied the same selection method as used in I Williams et al.l ([20 09) to separate the star forming and quiescent 
galaxies, independent of the method used in this paper. The resulting sample of massive, quiescent galaxies are very 
similar, the fraction of quiescent galaxies agree within 5% at all redshifts. In Figure 11a, the massive, quiescent galaxies 
do indeed cleanly separate from star forming galaxies in the UVJ color-color plot. A few of the quiescent galaxies 
selected in the UVJ method have large amounts of dust reddening, as seen in Figure lib by plotting the extinction- 
corrected U-V colors as a function of mass. These blue quiescent galaxies will increase the measured intrinsic scatter 
at high redshift while making the mean color at these redshifts bluer by ~0.2-0.4 mag. 

We have repeated the analysis as described in §3 for this new sample of quiescent galaxies. In Figure 12, we show that 
the measured intrinsic scatter of the UVJ sample of quiescent galaxies rises even more steeply at z>l. These trends in 
the color scatter imply even shorter characteristic timescales for star formation as well as luminosity- weighted ages that 
are dominated by star formation close to the end of their star formation histories. This may be due to contamination 
by star- forming galaxies; whatever the cause, we note that our selection is more conservative 

In Figure 13, we show the composite rest-frame SED of the £7VJ-selected sample of quiescent galaxies, derived 
using the same method as described in §3.4. Comparing the distribution of colors from the [TVJ-selection criterion to 
the histograms in the inset panels of Figure [7J we see that the galaxies selected here extend to bluer U-V colors and 
have broader distributions. We also see that the bluest quiescent galaxies from this selection method show stronger 
rest-frame UV emission and there is a more pronounced clump of blue points that lie above the model around 5000- 
6000A in the highest redshift bin. This may be evidence for emission lines in these blue galaxies. Emission lines are 
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Fig. 11. — (Left) The U-V versus V-J color-color diagram for all galaxies in the NMBS sample (greyscale). Those galaxies that are both 
massive (>10 n Mq) and have rest-frame U-V colors within the selection-window of Williams et al. (2009) are show as black, filled circles 
and those galaxies that are selected based on their U-V colors corrected for dust reddening in this paper are red, filled circles. (Right) 
The color-mass diagram for all galax ies in the NMBS sample (greyscale) with the quiescent sample that would be selected following the 
UVJ-method of Williams et al. (2009) in black and the dust extinction corrected color selection in red. 
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Fig. 12. — The observed (green) and intrinsic (black) scatter in the rest-frame U-V colors of the most massive, quiescent galaxies selected 
by their U-V and V-J colors. The scatter due to measurement errors (blue) is removed from the observed scatter. The vertical black error 
bars mark the 68% confidence intervals due to photometric errors and the larger green error bars are systematic errors due to the selection 
method. The dashed line is the expected evolution of the intrinic scatter due to passive evolution for galaxies that started forming stars 
at z=3 with a characteristic timescale for transformation into an early- type galaxy of 1.4 Gyrs, with a large burst of star formation before 
transforming. 
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Fig. 13. — The composite rest-frame SED of massive, quiescent galaxies from z=0.2 to 2.2, selected based on their U-V and V-J colors. 
The reddest and bluest U-V quartiles are color-coded as red and blue, and the solid lines are the median best-fit templates as derived 
from the spectral modeling analysis. The sub-panels in each redshift range are the histograms of (U-V)\ U-V with the slope of the 
color-magnitude relation removed. The total number of galaxies in each bin is labeled in the top left of each panel. There is an even larger 
difference between the composite SEDs of the red and blue quartiles at z~2, compared to our more conservative sample in Figure [6] 



generally too weak to affect the SED modeling, although their signal is (just) strong enough to identify in the average, 
de-redshifted spectrum of star- forming galaxies with the NMBS (Brammer et al. in prep). These quiescent galaxies 
with even bluer colors that enter the sample at z~2 are the the massive galaxies just below our selection limit. Most of 
these galaxies also have MIPS detections and may therefore have some recent star formation or host AGN. In general, 
our selection based on the extinction-corrected U-V colors is in fact more conservative than a [TVJ-selection. 

In summary, a selection of quiescent galaxies based on their U-V and V-J colors yields a very similar sample of 
galaxies. Both methods result in the same fundamental differences between the reddest and bluest quiescent galaxies. 

APPENDIX B. THE EFFECTS OF PHOTOMETRIC SCATTER 

We consider what the rest-frame SEDs would look like if the scatter in color was in fact due solely to photometric 
errors of a single age population. We find the median SED for all quiescent galaxies in the highest redshift bin and use 
this template to derive what the observed flux would be if these galaxies all had the same age but a range of redshifts 
between 1.8 < z < 2.2. Next we perturb the fluxes by increasing factors of the observed error bars and re- measure the 
rest-frame colors until the scatter in the rest-frame U-V color matches our observed scatter of <ttj-v ~0. 15-0. 17. The 
error bars have to be increased by a factor of 8 to yield an observed scatter consistent with our observations. 

Although it is doubtful that our photometric errors are underestimated by a factor of 8, we plot the SEDs of 
all simulated fluxes for a single-age stellar population (black model) with scatter that results from the increased 
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Fig. 14. — (Left) Simulated fluxes of a single age stellar population (black model) with scatter due solely to photometric errors. The 
error bars had to be scaled up by a factor of 8 to measure a scatter in U-V comparable to our results. The red and blue models are the 
median spectral synthesis models of the composite rest-frame SED shown here as points. The reddest and bluest quartiles in U-V are the 
red and blue points, respectively. 

Fig. 15. — (Right) The composite rest-frame SEDs of the red and blue quartiles at 1.8 < z < 2.2, systematically offsetting the photometric 
redshifts by Az= ±0.12, thereby shifting the blue stack to longer wavelengths and the red stack to shorter wavelengths (top panel). The 
black model is the average best-fit model from the blue and red quartiles. The residuals of the observed fluxes with respect to the average 
model is shown in the bottom panel, with the running median for each quartile shown as a dot-dashed line. 



photometric errors in Figure 14 to see if we find similar trends to Figure El We plot all simulated fluxes and indicate 
those galaxies with the reddest U-V colors in red and those with the bluest U-V colors in blue, exactly as we have 
done in Figure [U We see that the median best-fit spectral synthesis model for the bluest quartile does emit slightly 
more radiation at all wavelengths than the reddest quartile blueward of 7000 A (as we have selected them). Although 
the models are well-behaved, the important result is that the observed fluxes are incoherent. These simulations show 
that the trends we observe cannot be the result of photometric errors alone. 

Another concern is that the photometric redshifts suffer from both random and systematic errors. To understand 
the effects of random uncertainties in the photometric redshifts on the composite SEDs, we perturbed the redshifts of 
all quiescent galaxies in the highest redshift bin by their 68% confidence level uncertainties and re-fit the photometry. 
Although some scatter is introduced to the SEDs, we still find that the reddest galaxies emit systematically lower flux 
than the bluest galaxies on the red sequence at all wavelengths < 7000 A. Although our results hold strong against 
random unc ertainties in the ph otometric redshifts, systematic uncertainties may have a more severe effect. 

Recently, iKriek et"aT1 (j2009[ ) took a follow-up deep 30-hour spectra of a massive quiescent galaxy and derived a 
spectroscop i c reds hift of z spe c=2.186. This galaxy was selected from the original sample of 9 quiescent galaxies from 
IKriek et al.l (|2006l ). where the spectroscopic redshift with a 6-hour integration was z spec =2.31. The error in z spec of 
0.12 in redshift space shifted the 4000 A break to shorter wavelengths resulting in a continuum shape that appeared 
to instead have a strong Balmer-break and therefore a younger stellar population. If the photometric redshifts of the 
blue quiescent galaxies have been systematically under-estimated and the photometric redshifts of the oldest galaxies 
have been systematically over-estimated, we could falsely produce the spread of ages found in this work. To test how 
systematic errors would change the results, we assumed that the photometric redshifts of the bluest quartile were all 
under-estimated by Az=0.12 and the reddest quartile were over-estimated by Az=0.12 and re-fit the models. For 
this extreme case, the residuals between the average model of all of the data and the two quartiles overlap in the 
Balmer/4000A break region, however the rest-frame UV and rest-frame NIR show clear differences (see Figure 15). 
If we relax the systematic shifts from Az=0.12 to 0.06, the residuals once again show a clear difference over the 
entire wavelength regime. Although we cannot rule out that systematic effects may significantly affect our results, our 
photometric redshift errors would need to conspire quite drastically to wash out the fundamental differences between 
the SEDs of the reddest and bluest galaxies on the red sequence. 

APPENDIX C. THE EFFECTS OF TEMPLATE SETS ON THE PHOTOMETRIC REDSHIFTS AND 

REST-FRAME COLORS 

In this section, we briefly discuss how the template sets used to fit photometric redshifts and rest-frame colors 
can have important systematic effects. The optimized template set used for the EAZY photometric redshift code 
contains 6 templates, which is large enough to span a broad range of galaxy colors, while minimizing color and redshift 
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Fig. 16. — (Left) The difference in redshift (z S pec-£phot) for photometric redshifts measured using Template Set 1 (which includes an 
additional old, red template, shown in grey) and Template Set 2 (the default set of EAZY templates, shown in black). The photometric 
redshifts of the reddest galaxies are systematically under estimated if the additional old template is not included. (Middle) The difference 
in measured rest-frame U-V colors, (U-V) spe c - (U-V) p h ot , as measured using the photometric redshifts from Template Set 1 (grey) and 
Template Set 2 (black). The systematic offset of ~0.05 in redshift space results in an offset of ~0.1 magnitudes in color when using Template 
Set 2. (Right) The U-V distributions as measured from the spectroscopic redshifts (grey) and the photometric redshifts measured using 
Template Set 1 and 2. When the additional very old template is not included, the color distribution is artifically narrowed, and very peaked 
relative to the broader distribution of colors from the spectroscopic redshifts. 

degeneracies. The default template set is described in detail in iBrammer et al.1 ([200 8): the set includes 5 templates 
generated based on the PEGASE models and calibrated with synthetic photometry from semi-analytic models, as well 
as an additional young and dusty template added to compensate for the lack of extremely dusty galaxies in semi- 
analytic models (we call this "Template Set 2" here). "Template Set 1", which is the templ ate set used in th is paper, 
includ es an additiona l template for a very old galaxy. This old, red template is from the M arastonl ([20 05) models, 
with a iKroupal (|2001[ ) IMF and solar metallicity for a stellar population that has an age of 12.6 Gyrs. 

Figure 16 shows how the additional old (red) template effects both the photometric redshifts measured as well 
as the U-V colors. In the left panel, the difference between the spectroscopic and photometric redshifts is strongly 
correlated with the difference in the U-V colors measured from these respective redshifts. The trend results in a 
systematic uncertainty in the U-V color (as seen by the spread of the A(U-V) distribution in the middle panel) of 0.05 
magnitude^- In addition to this systematic uncertainty, there can be a systematic offset if the template set does not 
span the entire range of colors of the galaxies. In this case, Template Set 2 does not include a red enough template 
and therefore tends to over-estimate the photometric redshifts of the reddest galaxies (the black histogram in the left 
panel is offset). Furthermore, this over-estimate of the redshift leads to colors that are too blue (see the middle panel). 
Finally, if we compare the U-V colors as measured from the spectroscopic redshifts and the photometric redshifts with 
the two template sets, we see that the color distribution is artificially peaked for Template Set 2. In other words, when 
the template set does not span the full range of galaxy colors, the scatter in color will be significantly under-estimated. 

APPENDIX D. THE EFFECTS OF DUST AND STAR FORMATION ON THE COMPOSITE SEDS 

In §3.4, we concluded that the trends of spectral shape with the U-V colors were mainly due to the spread in age 
of the quiescent galaxies. However, if the reddest galaxies are simply dustier versions of the bluest galaxies, we also 
might expect similar trends to result. Here we explore if the features we see in the composite SEDs in Figure [6] result 
not because the reddest quart ile is older than the bluest quart ile, but rather because the reddest galax ies are dustier. 
To tes t this, we take the smoothed best-fit model to the bluest quartile at 1.8 < z < 2.2 and apply the Calz etti et al.l 

(2000) dust extinction law to the spectrum with increasing amounts of dust to see if we can recover the shape of the 
reddest composite spectrum. From Figure 17, we see that for values of Ay ~1.0 mag match the data blueward of 
0.4/im, Ay ~0.5 mag matches the slope between ~0.4-0.7/im and values of Ay ~0.2 mag agree with the IRAC data. 
There is no single extinction that is consistent over the full SED. We know that the red quiescent galaxies at z>1.5 
are ~0.2 magnitudes dustier on average than the blue quiescent galaxies, however it does not appear that the trends 
we see in Figure [6] can be explained by dust alone. With that said, we cannot rule out the possibility that dust has a 
second order effect on the trends of spectral shape with age. 

Along the same lines, we suspect that some of these quiescent galaxies likely have ongoing star formation. The reddest 
quiescent galaxies probably have no recent star formation, whereas the bluest galaxies may have a more complicated 
star formation history that includes a recent burst of star formation. To understand how ongoing star-formation would 
effect the spectral shapes, we add Gaussian smoothed Maraston (2005) models with r=0.1 Gyr, Z , and a Kroupa 

(2001) IMF for a range of ages from 1 Myr to 0.4 Gyr to the composite SED of the reddest quartile. Recent star 
formation will have the largest contribution in the rest-frame UV, so we normalize these spectra to the observed flux 

18 We note that the rest-frame U-V color accuracy has a weak i i i i . , . n no A 

J dependence on mass and luminosity evolution, <0.02 magnitudes. 
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Fig. 17. — (Left) The effect of increasing levels of dust reddening on the blue composite SED, as compared to the red composite SED. 
No value of Ay matches the reddest composite SED over the entire wavelength range. Dust may have some contribution to the trends of 
Figure [6] but they are second order to the trends of U-V with age. 

Fig. 18. — (Right) The effect of ongoing star formation on the red composite SED, as compared to the blue composite SED. We add 
Gaussian smoothed Maraston (2005) models with r=0.1 Gyr, Zq and a Kroupa (2001) IMF for ages of 0.001, 0.1 and 0.4 Gyr normalized 
at 0.2/xm to the red composite SED, re-normalizing at 0.7/xm. A more complex star formation history does not result in a spectral shape 
similar to the blue composite SED; we are able to match the rest-frame UV of the bluest galaxies but not the Balmer-break region. 



of the bluest quartile composite SED at 0.2/im. We add a 0.001, 0.01, 0.1 and 0.4 Gyr stellar population to the reddest 
quartile composite spectrum and re- normalize the spectrum at 7000 A. In Figure 18, we see that Balmer/4000A-break 
region remains effectively unchanged regardless of the age of the young component we add to the red composite SED, 
while the slope in the rest-frame UV steadily rises blueward of ~0.3/im. From this simple analysis, we conclude that 
the difference between the blue and red composite SEDs cannot be the result of ongoing star formation in a subset of 
the quiescent galaxies. This does not mean that the quiescent galaxies have no ongoing star formation, rather that 
the trends we see in Figure [6] are not likely caused by more complex star formation histories. 

APPENDIX E. THE EFFECTS OF METALLICITY VARIATIONS ON THE INTRINSIC U-V SCATTER 

In this paper, we assume that U-V color scatter and variations in SED shapes are due to age variations between 
galaxies. However, metallicity v ariations could als o contribute to the measured intrinsic scatter in U-V color. Here we 
quantify this effect by using the iMarastonl (j20Q5h models to predict how the U-V scatter should evolve with redshift 
if the scatter is caused solely by metallicity variations (and not a spread in age). In the left panel of Figure HU we 
see that red U-V colors result from both older stellar populations and higher metallicities. Furthermore, the scatter 
in metallicity depends on the age of the stellar population. For ages >3 Gyr, the scatter in U-V due to metallicity 
variations is roughly constant, with a smaller scatter for ages <3 Gyr. The relation between U-V and metallicity is 
shown in the middle panel of Figure [191 an d is roughly linear with an age-dependent slope a. To characterize how 
the slope of the color-metallicity relation evolves with the age of the stellar population, we fit the U-V color for 4 
metallicities with a linear function and plot the slope as a function of the age in the right panel of Figure HU 

Given the evolution of the slope of the color-metallicity relation, we can predict how we expect the scatter in U-V 
to evolve with redshift for a given scatter in metallicity. We first assume the age of the stellar population at 2=0, 
which determines the slope of the color-metallicity relation from the right panel of Figure [T9l We will further assume 
that <ttj_v = 0-03 at z = 0, as determined in the Coma cluster ([Bower et al.|[l9 92). Given a and the scatter in U-V 
at z = 0, we can predict the scatter in U-V due to logZ/Z , 

au-y(z = 0) -a(z = 0) 
*v-v{z) = ^ (2) 

The scatter due to metallicity, a\j-y(z = 0) • a(z = 0), is constant for a given age. However, the slope a(z) will change 
as the population evolves, leading to an evolving color scatter. In Figure [20l we find that the scatter in U-V due to 
metallicity variations alone will stay roughly constant for the oldest stellar populations and eventually decrease. This 
trend in <ttj-v is opposite to the measured evolution of the intrinsic scatter in U-V with redshift. We therefore expect 
that metallicity variations will not have a large effect on this work. 
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Fig. 19. — (Left) The U-V color as a function of the age of the stellar population from the Maraston (2005) models for a range of 
metallicities. The U-V colors are redder for higher metallicity and older stellar populations. The scat ter in U-V is rou ghly constant for 
stellar populations older than ~3 Gyr. (Middle) The U-V color as a function of metallicity, from the [Maraston ( 2005) models shown in 
the left panel. The points are fit linearly for 5 different ages (dotted lines), showing that the slope (a) increases with age. (Right) The 
slope of the relation between U-V and metallicity as a function of the age of the stellar population. 



0.20 



0.15 



0.10 



0.05 



0.00 



■ This Work 

"A Bower et al. (1992) (Coma cluster) 
& van Dokkum et ol. (2008) (cluters) >- 



0.0 



0.5 




Evolution in due to 
Metallicity variations 



1.0 

Redshift 



1.5 



2.0 



Fig. 20. — The intrinsic scatter in [/-V as a function of redshift as measured in clusters (Bower et al. 1992; van Dokkum 2008), as well as 
in this work. We compare the measured values to the expected evolution in <7u— v if the color scatter is due solely to metallicity variations 
for a single aged stellar population. Metallicity variations will increase the scatter in U-V for older ages and therefore lower redshifts, 
opposite to the trends found in this work. 



